Page 1 of 8 





RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/716, 147 



DATE: 05/20/2003 
TIME: 07:51:00 



Input Set : N:\Crf3\RULE60\09716147.RAW.txt 
Output Set: N:\CRF4\05202003\l716147.raw 



SEQUENCE LISTING 

(1) GENERAL INFORMATION: 







o 
z 




APPLICANT: Hinuma, Shuji 












Habata, Yugo 






fi 






Kawamata, Yuji 






c 
D 






Hosoya, Masaki 






a 
o 






Fujii, Ryo 






1 






Fukusumi, Shoji 






Q 

o 






Kitada, Chieko 






Q 

y 


/ A i \ 

(11 ) 


TITLE OF INVENTION: POLYPROTEINS, THEIR 






1U 


(111 J 


NUMBER OF SEQUENCES: 140 






i i 


(iv; 


CORRESPONDENCE ADDRESS: 










(A) 


ADDRESSEE: DIKE, BRONSTEIN, ROBERTS 










(B) 


STREET: 130 Water Street 






1 A 




(C) 


CITY: Boston 






1 0 




(D) 


STATE: MA 






± D 




(E) 


COUNTRY: USA 






± / 




(F) 


ZIP: 02109 






1 ft 


\ v ; 


COMPUTER READABLE FORM: 










(A) 


MEDIUM TYPE: Diskette 










(B) a 


COMPUTER: IBM compatible 






21 




(C) 


OPERATING SYSTEM: DOS 






22 




(D) 


SOFTWARE: FastSEQ for Windows Versi 






23 


(vi) 


CURRENT APPLICATION DATA: 


c- 


-> 


24 




(A) 


APPLICATION NUMBER: US/09/716,147 


c- 


-> 


25 




(B) 


FILING DATE: 17-Nov-2000 






26 




(C) 


CLASSIFICATION: 






27 


(vii) 


PRIOR APPLICATION DATA: 


w- 


-> 


28 




(A) 


APPLICATION NUMBER: US/08/776 , 971B 






29 




(B) 


FILING DATE: 06-Feb-1997 


w- 


-> 


30 




(A) 


APPLICATION NUMBER: PCT/ JP96/03821 






31 




(B) 


FILING DATE: 28-DEC-1996 


w- 


-> 


32 




(A) 


APPLICATION NUMBER: JP 7/343371 






33 




(B) 


FILING DATE: 28-DEC-1995 


w- 


-> 


34 




(A) 


APPLICATION NUMBER: JP 8/59419 






35 




(B) 


FILING DATE: 15-MAR-1996 


w- 


-> 


36 




(A) 


APPLICATION NUMBER: JP 8/211805 






37 




(B) 


FILING DATE: 12-AUG-1996 


w- 


-> 


38 




(A) 


APPLICATION NUMBER: JP 8/246573 






39 




(B) 


FILING DATE: 18-SEP-1996 






40 


(viii) 


ATTORNEY/AGENT INFORMATION : 






41 




(A) 


NAME: Conlin, David G. 






42 




(B) 


REGISTRATION NUMBER: 27,026 



& CUSHMAN, LLP 



ENTERS 0 



2.0 
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43 (C) REFERENCE /DOCKET NUMBER: 47176 

4 4 (ix) TELECOMMUNICATION INFORMATION: 

45 (A) TELEPHONE: 617-523-3400 

46 (B) TELEFAX: 617-523-6440 

47 (2) INFORMATION FOR SEQ ID NO: 1: 

4 8 (i) SEQUENCE CHARACTERISTICS: 

49 (A) LENGTH: 98 amino acids 

50 (B) TYPE: amino acid 

51 (C) STRANDEDNESS: single 

52 (D) TOPOLOGY: linear 

53 (ii) MOLECULE TYPE: protein 

54 (v) FRAGMENT TYPE: internal 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

56 Met Lys Ala Val Gly Ala Trp Leu Leu Cys Leu Leu Leu Leu Gly Leu 

57 1 5 10 15 

58 Ala Leu Gin Gly Ala Ala Ser Arg Ala His Gin His Ser Met Glu He 

59 20 
60 



25 30 
Arq Thr Pro Asp He Asn Pro Ala Trp Tyr Ala Gly Arg Gly lie Arg 

61 35 40 45 

62 Pro Val Gl* --- 

63 50 55 • 60 

64 . 7Q 

65 Ala Glu Pro Ser Arg Ala Leu Pro Gly Arg Leu Thr Ala Gin Leu Val 

85 90 95 



Pro Val Gly Arg Phe Gly Arg Arg Arg Ala Ala Pro Gly Asp Gly Pro 

50 55 * 60 

Arg Pro Gly Pro Arg Arg Val Pro Ala Cys Phe Arg Leu Glu Gly Gly 



66 
67 

68 Gin Glu 

70 (2) INFORMATION FOR SEQ ID NO: 2: 

71 (i) SEQUENCE CHARACTERISTICS: 

72 (A) LENGTH: 294 base pairs 

73 (B) TYPE: nucleic acid 

74 (C) STRANDEDNESS: double 

75 (D) TOPOLOGY: linear 
7 6 (ii) MOLECULE TYPE: cDNA 

77 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: ^^nnnn £0 

78 ATGAAGGCGG TGGGGGCCTG GCTCCTCTGC CTGCTGCTGC TGGGCCTGGC CCTGCAGGGG 60 
ll GCTGCCAGCA GAGCCCACCA GCACTCCATG GAGATCCGCA CCCCCGACAT CAACCCTGCC 120 
80 tggScgcrg GCCGTGGGAT CCGGCCCGTG GGCCGCTTCG GCCGGCGAAG AGCTGCCCYG 180 
8? GGGGACGGAC CCAGGCCTGG CCCCCGGCGT GTGCCGGCCT GCTTCCGCCT GGAAGGCGGY 240 
82 GCTGAGCCCT CCCGAGCCCT CCCGGGGCGG CTGACGGCCC AGCTGGTCCA GGAA 294 

84 (2) INFORMATION FOR SEQ ID NO: 3: 

85 (i) SEQUENCE CHARACTERISTICS: 

86 (A) LENGTH: 29 amino acids 

87 (B) TYPE: amino acid 

88 (C) STRANDEDNESS: single 

89 (D) TOPOLOGY: linear 

90 (ii) MOLECULE TYPE: protein 

91 (v) FRAGMENT TYPE: internal 

92 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

93 Ser Arg Ala His Gin His Ser Met Glu lie Arg Thr Pro Asp lie Asn 
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94 1 5 10 15 

95 Pro Ala Trp Tyr Ala Gly Arg Gly He Arg Pro Val Gly 

96 20 25 

98 (2) INFORMATION FOR SEQ ID NO: 4: 

99 (i) SEQUENCE CHARACTERISTICS: 

100 (A) LENGTH: 19 amino acids 

101 (B) TYPE: amino acid 

102 (C) STRANDEDNESS: single 

103 (D) TOPOLOGY: linear 

104 (ii) MOLECULE TYPE: protein 

105 (v) FRAGMENT TYPE: internal 

106 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

107 Thr Pro Asp He Asn Pro Ala Trp Tyr Ala Gly Arg Gly He Arg Pro 

108 15 10 15 

109 Val Gly Arg 

111 (2) INFORMATION FOR SEQ ID NO: 5: 

112 (i) SEQUENCE CHARACTERISTICS: 

113 (A) LENGTH: 31 amino acids 

114 (B) TYPE: amino acid 

115 (C) STRANDEDNESS: single 

116 (D) TOPOLOGY: linear 

117 (ii) MOLECULE TYPE: protein 

118 (v) FRAGMENT TYPE: internal 

119 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

120 Ser Arg Ala His Gin His Ser Met Glu He Arg Thr Pro Asp He Asn 

121 15 10 15 

122 Pro Ala Trp Tyr Ala Gly Arg Gly He Arg Pro Val Gly Arg Phe 

123 20 25 30 

125 (2) INFORMATION FOR SEQ ID NO: 6: 

126 (i) SEQUENCE CHARACTERISTICS: 

127 (A) LENGTH: 32 amino acids 

128 (B) TYPE: amino acid 

129 (C) STRANDEDNESS: single 

130 (D) TOPOLOGY: linear 

131 (ii) MOLECULE TYPE: protein 

132 (v) FRAGMENT TYPE: internal 

133 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

134 Ser Arg Ala His Gin His Ser Met Glu He Arg Thr Pro Asp He Asn 

135 15 10 15 

136 Pro Ala Trp Tyr Ala Gly Arg Gly He Arg Pro Val Gly Arg Phe Gly 

137 20 25 30 

139 (2) INFORMATION FOR SEQ ID NO: 7: 

140 (i) SEQUENCE CHARACTERISTICS: 

141 (A) LENGTH: 33 amino acids 

142 (B) TYPE: amino acid 

143 (C) STRANDEDNESS: single 

144 (D) TOPOLOGY: linear 

145 (ii) MOLECULE TYPE: protein 
14 6 (v) FRAGMENT TYPE: internal 
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147 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

148 Ser Arg Ala His Gin His Ser Met Glu He Arg Thr Pro Asp He Asn 

149 15 10 15 

150 Pro Ala Trp Tyr Ala Gly Arg Gly He Arg Pro Val Gly Arg Phe Gly 

151 20 25 30 

152 Arg 

154 (2) INFORMATION FOR SEQ ID NO: 8: 

155 (i) SEQUENCE CHARACTERISTICS: 

156 (A) LENGTH: 20 amino acids 

157 (B) TYPE: amino acid 

158 (C) STRANDEDNESS: single 

159 (D) TOPOLOGY: linear 

160 (ii) MOLECULE TYPE: protein 

161 (v) FRAGMENT TYPE: internal 

162 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

163 Thr Pro Asp He Asn Pro Ala Trp Tyr Ala Gly Arg Gly He Arg Pro 

164 15 10 15 

165 Val Gly Arg Phe 

166 20 

168 (2) INFORMATION FOR SEQ ID NO: 9: 

169 (i) SEQUENCE CHARACTERISTICS: 

170 (A) LENGTH: 21 amino acids 

171 (B) TYPE: amino acid 

172 (C) STRANDEDNESS: single 

173 (D) TOPOLOGY: linear 

174 (ii) MOLECULE TYPE: protein 

175 (v) FRAGMENT TYPE: internal 

17 6 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

177 Thr Pro Asp He Asn Pro Ala Trp Tyr Ala Gly Arg Gly He Arg Pro 

178 15 10 15 

179 Val Gly Arg Phe Gly 

180 20 

182 (2) INFORMATION FOR SEQ ID NO: 10: 

183 (i) SEQUENCE CHARACTERISTICS: 

184 (A) LENGTH: 22 amino acids 

185 (B) TYPE: amino acid 

186 (C) STRANDEDNESS: single 

187 (D) TOPOLOGY: linear 

188 (ii) MOLECULE TYPE: protein 

189 (v) FRAGMENT TYPE: internal 

190 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

191 Thr Pro Asp He Asn Pro Ala Trp Tyr Ala Gly Arg Gly He Arg Pro 

192 15 10 15 

193 Val Gly Arg Phe Gly Arg 

194 20 

196 (2) INFORMATION FOR SEQ ID NO: 11: 

197 (i) SEQUENCE CHARACTERISTICS: 

198 (A) LENGTH: 87 base pairs 

199 (B) TYPE: nucleic acid 
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200 (C) STRANDEDNESS: double 

201 (D) TOPOLOGY: linear 

202 (ii) MOLECULE TYPE: cDNA 

11] <«jj^£5^^^>^ ccc TGCCIGGTSC S 

205 GCRGGCCGTG GGATCCGGCC CGTGGGC 

207 (2) INFORMATION FOR SEQ ID NO: 12: 

208 (D SEQUENCE CHARACTERISTICS: 

209 (A) LENGTH: 57 base pairs 

210 (B) TYPE: nucleic acid 
2n (C) STRANDEDNESS: double 

212 (D) TOPOLOGY: linear 

213 (ii) MOLECULE TYPE: cDNA 

l\i 'l^xS^^^^^'Si^ —COT cccccc 

217 (2) INFORMATION FOR SEQ ID NO: 13: 

218 (i) SEQUENCE CHARACTERISTICS: 

219 (A) LENGTH: 93 base pairs 

220 (B) TYPE: nucleic acid 

221 (C) STRANDEDNESS: double 

222 (D) TOPOLOGY: linear 

223 (ii) MOLECULE TYPE: cDNA 

HI *™ TGCCTGGTAC «0 

226 GCRGGCCGTG GGATCCGGCC CGTGGGCCGC TTC 

228 (2) INFORMATION FOR SEQ ID NO: 14: 

229 (i) SEQUENCE CHARACTERISTICS: 

23 0 (A) LENGTH: 96 base pairs 

231 (B) TYPE: nucleic acid 

232 (C) STRANDEDNESS: double 

233 (D) TOPOLOGY: linear 

234 (ii) MOLECULE TYPE: cDNA 

HI '^^^^^^^ »™CC TG~C 60 

237 GCRGGCCGTG GGATCCGGCC CGTGGGCCGC TTCGGC 

239 (2) INFORMATION FOR SEQ ID NO: 15: 

240 (i) SEQUENCE CHARACTERISTICS: 

241 (A) LENGTH: 99 base pairs 

242 (B) TYPE: nucleic acid 

243 (C) STRANDEDNESS: double 

244 (D) TOPOLOGY: linear 

245 (ii) MOLECULE TYPE: cDNA 

1% ^^^^sss^s^^ »™ mnc tl 

248 GCRGGCCGTG GGATCCGGCC CGTGGGCCGC TTCGGCCGG 

250 (2) INFORMATION FOR SEQ ID NO: 16: 

251 (i) SEQUENCE CHARACTERISTICS: 

252 (A) LENGTH: 60 base pairs 

253 (B) TYPE: nucleic acid 
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Please Note: . . , <n ^ Seauence Listing. Please review the 

Seq#:29; N Pos . 20 
Seq#:30; N Pos. 3,12 
Seq#:31; N Pos. 15,16 
Seq#:32; N Pos. 23 
Seq#:35; N Pos. 3 
Seq#:36; N Pos. 3,9,18,24 
Seq#:37; N Pos. 15,21 
Seq#:73; Xaa Pos . 10 , 11 , 21 , 22 
Se q#:74; Xaa Pos.3,5,10 
Seq#:H4; Xaa Pos. 43, 80 
Seq#:H5; Xaa Pos. 43, 80 
Seq#:H6; Xaa Pos. 43, 80 
Seq#:H7; Xaa Pos. 43, 80 
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PATENT APPLICATION: US/09/716,147 
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L:24 M:220 C: Keyword misspelled or invalid format, [(A) APPLICATION NUMBER: ] 
L-25 M-220 C: Keyword misspelled or invalid format, [(B) FILING DATE:] 

L-30 M : 238 W: Alpha Fields not Ordered, Reordered [(A) APPLICATION NUMBER: ] of (l)(vn 
L : 32 M-238 W: Alpha Fields not Ordered, Reordered [(A) APPLICATION NUMBER:] of (1) (vii) 
L : 34 M : 238 W Alpha Fields not Ordered, Reordered [(A) APPLICATION NUMBER:] of (l)(vxi) 
l"-36 M : 238 W Alpha Fields not Ordered, Reordered [(A) APPLICATION NUMBER:] of (D(vn) 
l!38 m!238 W: Alpha Fields not Ordered, Reordered [(A) APPLICATION NUMBER:] of d)(vn] 
L:1140 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:73 after pos . : 0 
M:341 Repeated in SeqNo=73 

L:1166 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:74 after pos . : 0 
L:1990 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:114 after pos.: 146 
M:341 Repeated in SeqNo=114 

L:2022 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:115 after pos.:32 
M:341 Repeated in SeqNo=115 

L:2050 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:116 after pos.:146 
M:341 Repeated in SeqNo=116 

L:2082 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:117 after pos.:32 
M:341 Repeated in SeqNo=117 
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